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(54) Apparatus and method for motion vector estimation with high speed 



(57) In a motion vector estimating apparatus, a cur- 
rent picture storage unit (1) stores image data of a cur- 
rent picture, and a reference picture storage unit (2) 
stores image data of a reference picture. A search win- 
dow determining unit (4) determines estimation history 
from previously estimated motion vectors, and deter- 
mines a search window based on the estimation history. 
At least one of a shape, size and position of the search 



window is determined based on the estimation history. 
The search window is composed of rectangular refer- 
ence regions. A block matching circuit (300-1 , 300-2) for 
performing a block matching process to a current block 
and each of reference blocks of the search window to 
determine a motion vector. The search window may be 
limited in units of pixels, or a load of the apparatus, a 
power supply voltage or a temperature of the block 
matching circuit. 
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Description 

Background of th Invention 

s 1 , Field of the Invention 

The present invention is related to an apparatus and a method for estimating a motion vector utilized to compensate 
for motion of an image in a prediction coding process of a moving picture. 

10 2. Description of the Related Art 

In a conventional a method and apparatus for estimating a motion vector estimating method, a current picture to 
which a prediction coding process is performed is divided into a plurality of blocks, for example, to have 16 pixels in a 
horizontal direction and 1 6 pixels in a vertical direction. These blocks are referred to as current blocks. A search region 
is is predetermined in a reference picture for each of the above-described current blocks to have as a center of the search 
region the positions corresponding to the horizontal position and the vertical position of a pixel position of each current 
block. The predetermined search region in the reference picture is referred to as a search window. A block matching 
process is carried out between a specific one of the current blocks and the search window corresponding to the specific 
current block to estimate a position on which a predetermined evaluation function has the minimum value for obtaining 
20 a motion vector. The conventional motion vector estimating apparatus is directed to a purpose that the motion vector 
can be more correctly obtained. 

A first conventional example of a motion vector estimating apparatus is disclosed in, for example, Japanese Laid 
Open Patent Application (JP-A-Showa 61 -201 583). In the reference, a limit value is provided to limit a reference region 
used to search the motion vector in accordance with a change amount of a motion vector detected in a previous frame 
25 to a current frame. When the motion vector change amount exceeds the limit value, this limit value is outputted. 

Also, a second conventional example of the motion vector estimating apparatus is proposed in Japanese Laid 
Open Patent Application (JP-A-Heisei 5-328333), in which several search windows or regions for a motion vector are 
prepared to be different from each other in position. One of the several reference windows or regions is selected in 
accordance with the change amounts of the motion vectors detected in a previous frame to the current frame. 
30 Further, a third conventional example of the motion vector estimating apparatus is proposed in Japanese Laid 

Open Patent Application (JP-A-Heisei 7-203457), in which an accumulating process of absolute values of differences 
is executed in a parallel manner in the block matching process. As a result, the capacity of a buffer for signals used in 
the accumulation is reduced so as. to increase the processing speed. 

Furthermore, a fourth conventional example of the motion vector estimating apparatus is proposed in Japanese 
35 Laid Open Patent Application (JP-A-Heisei 7-250328), in which motion vectors used to compensate for the moving 
image prediction are estimated in accordance with a plurality of prediction modes with a small hardware amount. 

There are the following problems in the above-described conventional examples of the motion vector .estimating 
techniques. 

(1 ) The first problem is in the following point. That is, power consumption cannot be reduced in the first conventional 
40 example of the motion vector estimating apparatus in which the limit value is provided to limit the search region for the 

motion vector. This is because in the motion vector estimating apparatus described in Japanese Laid Open Patent 
Application (JP-A-Showa 61-201583), the limit value is only introduced to the reference region. The calculation time 
and the calculation amount for the block matching process to be executed cannot be reduced in actual. In other words, 
since the calculation time and the calculation amount cannot be reduced, the power consumption of the motion vector 

45 estimating apparatus cannot be reduced. 

In the block matching process as described in, for instance, Japanese Laid Open Patent Application (JP-A-Heisei 
7-203457) and Japanese Laid Open Patent Application (JP-A-Heisei 7-250328), the operation time is reduced by 
employing parallel processing. However, the reason why the operation time cannot be reduced is that if the limit value 
is provided as in the motion vector estimating apparatus described in Japanese Laid Open Patent Application (JP-A- 

50 Showa 61 -201 583), the parallel degree of the above-described parallel processing decreases. As a result, the operation 
time cannot be reduced. 

(2) The second problem is in that the correct motion vector cannot be obtained in the conventional motion vector 
estimating apparatus as proposed in Japanese Laid Open Patent Application (JP-A-Heisei 5-328333), in which the 
several reference regions for the motion vector are prepared. This is because in the method for selecting the optimum 

55 reference region from the prepared reference regions, when pictures are such complex that more than one object 
move, the object moving direction cannot be exclusively determined. As a result, the optimum reference region cannot 
be selected from the prepared several reference regions. 

Also, in the above-described motion vector estimating apparatus described in Japanese Laid Open Patent Appli- 
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cation (JP-A-Heisei 5-328333), if the size and/or shape of the reference region cannot be previously determined then 
the parallel degree of the apparatus for executing the block matching process would be lowered. As a consequence 
since the shape of the reference region is limited and thus the proper reference region cannot be utilized the correct 
motion vector cannot be obtained. 

(3) The third problem is in that in the motion vector estimating apparatus proposed in Japanese Laid Open Patent 
Application (JP-A-Heisei 7-203457), the difference absolute value accumulating process is carried out in the parallel 
manner using the small buffer for signals for the calculation. However, in the motion vector estimating apparatus the 
hardware amount of the block matching circuit for executing the difference absolute value accumulating process in the 
parallel manner increases so that the chip area for the block matching circuit is increased as well as the power con- 
sumption is increased. This is because in the motion vector estimating apparatus as described in Japanese Laid Open 
Patent Application (JP-A-Heisei 7-203457) and Japanese Laid Open Paten Application (JP-A-Heisei 7-250328) a large 
amount of wiring lines must be provided for a processing element as the minimum unit of the block matching circuit 
Thus, the valid data can be continuously entered into the respective processing elements such that the parallel degree 
is increased. As a consequence, a chip area required for the wiring lines increases considerably. 

Also, in the above-explained motion vector estimating apparatus described in Japanese Laid Open Patent Appli- 
cation (JP-A-Heise. 7-203457) and Japanese Laid Open Patent Application (JP-A-Heisei 7-250328) there is another 
problem. That is, the m.nimum value determining unit determines the minimum value of a summation of absolute values 
of differences accumulated values in the respective processing elements, and the chip area required for wiring lines 
is considerably increased, because a total number of wiring lines is very large which are used to be connected from 
the minimum value determining unit. 

(4) The fourth problem is in that if the size and/or shape of the reference region is allowed to be varied in the motion 
vector estimating apparatus in which the difference absolute value accumulating process is executed in the parallel 
manner, the hardware amount of the block matching circuit for executing the difference absolute value accumulation 
process m the parallel manner is increased. As a result, the chip area increases and also the power consumption 
increases. Also, at this time, the parallel degree is reduced. This is because the large amount of wiring lines must be 
Provided to continuously enter the valid data into the processing elements as the minimum unit for constituting the 
block matching circuit. , 

Also it is because the data entering sequence of the processing element is greatly changed, depending upon the 
sizes and the shapes of the reference regions. As a result, the process for reading out the reference data from a storage 
unit becomes complex, so that the chip area increases and the power consumption increases 

(5) The fifth problem is in that the chip area of the block matching circuit increases in the motion vector estimating 
apparatus in which the motion vector is estimated in a plurality of prediction modes. This is because a large number 
of w.r.ng lines is required to connect the minimum value determining unit for determining the minimum value of the 
difference absolute value accumulated values to the respective processing elements in the motion vector estimating 
apparatus described in Japanese Laid Open Patent Application (JP-A-Heisei 7-250328). As a result the chip area 
required for the wiring lines is considerably increases. 

Also, the stage number of FIFOs (first-in first-out) required for the block matching circuit is increased. 

Summary of the Invention 

As a consequence, the present invention has been made to solve the above-described problems Therefore an 
object of the present invention is to provide a motion vector estimating method and a motion vector estimating apparatus 
which can estimate a motion vector in high precision with low power consumption and a saved chip area 

Another object of the present invention is to provide a motion vector estimating method and a motion vector esti- 
mating apparatus, in which when a parallel processing is applied to a block matching process, the parallel degree 
thereof is not reduced. a 

Still another object of the present invention is to provide a motion vector estimating method and a motion vector 
est.mat.ng apparatus, in which motion vectors of a plurality of prediction modes can be estimated while saving the chip 
area of a block matching circuit. 

. Ye ' sti " anotner ob f ect of ,ne P resen t invention is provide a motion vector estimating apparatus, which can be 
realized with the saved chip area of a block matching circuit regardless of the dimension and shape of a reference 

region. 

In order to achieve an aspect of the present invention, a motion vector estimating apparatus includes a current 
picture storage unit which stores image data of a current picture, a reference picture storage unit which stores image 
data of a reference p.cture, a search window determining unit which determines estimation history from previously 
estimated motion vectors, and determines a search window based on the estimation history, wherein the search window 
is composed of rectangular reference regions, a block matching circuit for performing a block matching process to a 
current block and each of reference blocks of the search window to determine a motion vector, and a control circuit 
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which reads out the image data of the current block from the current picture storage unit to supply the block matching 
circuit, which reads out the image data of each of the reference blocks of the search window from the reference picture 
storage unit to supply to the block matching circuit, and which controls the block matching circuit such that the block 
matching process is performed to the current block and the each reference block of the search window. 

5 At least one of a shape of the search window, a size of the search window and a position of the search window is 

determined based on the estimation history. 

The search window determining unit determines whether the number of previously estimated motion vectors ex- 
ceeds a predetermined number. When it is determined that the number of previously estimated motion vectors is less 
than the predetermined number, the search window determining unit neglects the previously estimated motion vectors 

10 and determines optimal size and shape of the search window. On the other hand, when it is determined that the number 
of previously estimated motion vectors exceeds the predetermined number, the search window determining unit pro- 
duces a 2-dimensional map in which each of the previously estimated motion vectors is mapped based on its horizontal 
value and the vertical value. Thus, the search window determining unit determines optimal size and shape of the search 
window such that the search window includes the history of the previously estimated motion vectors. Also, the search 

is window determining unit calculates average values of changes of the previously estimated motion vectors in the hor- 
izontal and vertical directions, respectively, and determines an optimal position of the search window such that, for 
example, a center position of the search window is shifted based on the average values. In this manner, the size, shape 
and position may be independently determined. On the contrary, the size, shape and position may be collectively 
determined. 

20 The motion vector estimating apparatus may further include a load detecting circuit for detecting either one of a 

load of the motion vector estimating apparatus or a load of the block matching circuit. In this case, the search window 
determining unit determines the search window based on the estimation history and the detected load. Also, the motion 
vector estimating apparatus may further include a voltage detecting circuit for detecting a power supply voltage. In this 
case, the search window determining unit determines the search window based on the estimation history and the 

25 detected power supply voltage. Further, the motion vector estimating apparatus may further include a temperature 
detecting circuit for detecting a temperature of a chip on which the block matching circuit is installed. In this case, the 
search window determining unit determines the search window based on the estimation history and the detected tem- 
perature. In addition, the search window determining unit may determine at least an unuseful region in the search 
window in unit of pixels based on the estimation history, and set a flag to each of pixels of the unuseful region such 

30 that the block matching process to the reference block which includes any pixel of the unuseful region is made invalid 
based on the flag of the pixel of the unuseful region. 

The block matching circuit includes processing elements of M columns and N rows (M and N are positive integers) 
provided such that a likelihood between the current block and each of the reference blocks is calculated through pipeline 
processing. In this case, the block matching circuit may further include a hierarchical selector structure such that each 

35 of the processing elements is supplied with one of the pixel data of each of the reference blocks. Also, the block 
matching circuit may further include (M + N) reference data buses, which are provided for the M columns and the N 
rows, and N current data buses provided for the N rows, respectively In this case, each of the (M + N) reference data 
buses transfers one pixel data of any of the reference blocks in a unit time of the pipeline processing. The current block 
is composed of M columns and N rows. 

40 The processing elements may be cascade-connected in each of the N rows and the N rows may be cascade- 

connected. 

In another case, the processing elements may be cascade-connected in each of the N rows (N is an even number). 
Odd-numbered ones of the N rows in an upper half are cascade-connected via first FIFO units, each of which is provided 
for two adjacent odd-numbered rows, wherein even-numbered ones of the N rows in the upper half are cascade- 
's connected via second FIFO units, each of which is provided for two adjacent even-numbered rows, wherein odd- 
numbered ones of the N rows in a lower half are cascade-connected via third FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and wherein even-numbered ones of the N rows in the lower half are cascade- 
connected via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows. Further, the block 
matching circuit further includes fifth and sixth FIFO units connected to last stages of processing elements in the 
so cascade connections for the odd-numbered rows and the even-numbered rows in the upper half, respectively. In this 
case, a number of stages in each of the first to fourth FIFO units is M, and a number of stages in the fifth FIFO unit is 
(M x N/2 + M), and a number of stages in the sixth FIFO unit is (M x N/2). 

In order to achieve another aspect of the present invention, a motion vector estimating apparatus includes process- 
ing elements of M columns and N rows (M and N are positive integers) , wherein the processing elements are cascade- 
55 connected in each of the N rows, wherein each of the processing elements calculates an absolute value of a difference 
between a corresponding pixel data of a current block and one pixel data of each of reference blocks, and adds the 
absolute value to an output from a previous stage of processing element to output the adding result to a next stage of 
processing element, and wherein the N rows are cascade-connected to allow pipeline processing, M reference data 
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column buses provided for the M columns, respectively, N reference data row buses provided for the N rows, respec- 
tively, (M x N) selectors, which are respectively provided for the processing elements to allow one of the pixel data on 
the corresponding one of the M reference data column buses and the pixel data on the corresponding one of the N 
reference data row buses to the corresponding processing element as the reference block pixel data, and a current 

5 block pixel data bus for transferring current block pixel data to each of the processing elements. 

The motion vector estimating apparatus may further includes a reference block data bus group for transferring 
pixel data of the reference blocks in units of rows as sequences of pixel data, a first group of selectors which are 
respectively provided for the M reference data column buses, and each of which selects one pixel data of one of the 
sequences to supply to the corresponding reference data column bus : and a second group of selectors which are 

io respectively provided for the N reference data row buses, and each of which selects one pixel data of one of the 
sequences to supply to the corresponding reference data row bus. In this case, each of the processing elements latches 
the corresponding current block pixel data once the corresponding current block pixel data is supplied via the current 
block pixel data bus. Also : each of the processing elements is synchronized with a clock signal to perform the calculation 
of the absolute value of the difference and the addition and to output the addition result to the next stage of processing 

'5 element. Thus, the processing elements calculates a likelihood of one of the reference blocks in a time period from a 
first time to (M x N) time, and calculates a likelihood of a different one of the reference blocks in a time period from a 
■second time to (M x N + 1) time. 

The current block is composed of pixels of M columns and N rows. 

In a case, the processing elements are cascade-connected in each of the N rows and the N rows are cascade- 
20 connected. 

In another case, the processing elements are cascade-connected in each of the N rows (N is an even number). 
In addition, odd-numbered ones of the N rows in an upper half are cascade-connected via first FIFO units, each of 
which is provided for two adjacent odd-numbered rows, even-numbered ones of the N rows in the upper half are 
cascade-connected via second FIFO units, each of which is provided for two adjacent even-numbered rows, odd- 
numbered ones of the N rows in a lower half are cascade-connected via third FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and even-numbered ones of the N rows in the lower half are cascade-connected 
via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows. Also, the block matching circuit 
further includes fifth and sixth FIFO units connected to last stages of processing elements in the cascade connections 
for the odd-numbered rows and the even-numbered rows in the upper half, respectively. In this case, a number of 
stages in each of the first to fourth FIFO units is M 5 and a number of stages in the fifth FIFO unit is (M x N/2 + M), and 
a number of stages in the sixth FIFO unit is (M x N/2). The motion vector estimating apparatus may further include a 
fist adder adding an output of the fifth FIFO unit and an output of the sixth FIFO unit, a second adder adding the output 
of the fifth FIFO unit and an output of the processing element in a last stage of odd-numbered cascade connection in 
the lower half, a third adder adding the output of the sixth FIFO unit and an output of the processing element in a last 
stage of even-numbered cascade connection in the lower half, a fourth adder adding the output of the processing 
element in the last stage of odd-numbered cascade connection in the lower half and the output of the processing 
element in the last stage of even-numbered cascade connection in the lower half, and a fifth adder adding an output 
of the second adder and an output of the third adder. * 

In still another case, the processing elements are cascade-connected in each of the N rows (N is an even number). 
Also, ones of the N rows in an upper half are cascade-connected via first FIFO units, each of which is provided for two 
adjacent rows, and ones of the N rows in a lower half are cascade-connected via second FIFO units, each of which is 
provided for two adjacent rows. In addition, the block matching circuit further includes a third FIFO units connected to 
the processing element in a last stage of cascade connection in the upper half. In this case, a number of stages in 
each of the first and second FIFO units is M, and a number of stages in the third FIFO unit is (M x N/2). The motion 
vector estimating apparatus may further include an adder adding an output of the third FIFO unit and an output of the 
processing element in a last stage of cascade connection in the lower half. 

In yet still another case, the processing elements are cascade-connected in each of the N rows (N is an even 
number). Also, odd-numbered ones of the N rows are cascade-connected via first FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and even-numbered ones of the N rows are cascade-connected via second FIFO 
so units, each of which is provided for two adjacent even-numbered rows. In this case, a number of stages in each of the 
first and second FIFO units is M. The motion vector estimating apparatus may further include an adder adding an 
output of the processing element in a last stage of odd-numbered cascade connection and an output of the processing 
element in a last stage of even-numbered cascade connection. 

In order to achieve still another aspect of the present invention, a method of estimating a motion vector, includes 
55 the steps of: 

determining estimation history from previously estimated motion vectors; 

determining a search window based on the estimation history, wherein the search window is composed of rectan- 
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guiar reference regions: and 

performing a block matching process to a current block and each of reference blocks of the search window to 
determine a motion vector. 

s Bri f Description of the Drawings 

Fig. 1 is a block diagram illustrating of the structure of a motion vector estimating apparatus according to a first 
embodiment of the present invention; 

Fig. 2 is a block diagram illustrating the structure of a block matching unit in the motion vector estimating apparatus 
10 in the first embodiment shown in Fig. 1; 

Fig. 3 is a connection diagram for explaining connections of a processing element in the block matching unit in 
the first embodiment shown in Fig. 2; 

Fig. 4 is a block diagram illustrating the internal structure of a processing element shown in Fig. 3; 

Fig. 5 is a block diagram illustrating the structure of a motion vector monitoring unit of the motion vector estimating 
is apparatus in the first embodiment shown in Fig. 1; 

Fig. 6A is a diagram for explaining a search window which is formed in such a manner that a plurality of rectangular 

shapes are synthesize, in the first embodiment of the present invention; 

Fig. 6B is a partial expanded diagram for explaining the search window shown in Fig. 6A; 
■ Fig. 7 is a diagram for explaining the relation of a previous picture, a search window including reference blocks 
20 and a current block of a current picture; 

Figs. 8A and 8B are schematic diagrams for explaining a block matching process operation of the block matching 

unit in the first embodiment of the present invention; 

Figs. 9A to 9F are schematic diagrams for explaining the block matching process operation of the block matching 
unit and the supply and selection of a reference block image data in the first embodiment of the present invention; 
25 Fig. 10 are schematic diagrams for explaining the supply and selection of a reference block image data in the first 

embodiment of the present invention; 

Fig. 1 1 is a block diagram for explaining a block matching circuit in a second embodiment of the present invention: 
Fig. 12 is a block diagram for explaining a block matching circuit in a third embodiment of the present invention; 
Fig. 1 3 is a diagram for explaining the supply and selection of a reference block image data in the third embodiment 
30 of the present invention shown in Fig. 12; 

Fig. 14 is a diagram illustrating the structure of the motion vector monitoring unit in a fourth embodiment of the 
present invention: 

Fig. 15 is a diagram to explain the operation of the motion vector monitoring unit in the fourth embodiment shown 
in Fig. 14; 

35 Fig. 16 is a block diagram illustrating the internal structure of a processing element of the block matching circuit 

in the fourth embodiment; and 

Fig. 17 is a block diagram illustrating the structure of the motion vector estimating apparatus according to a fifth 
embodiment of the present invention. 

40 D scription of the Preferred Embodiments 

A motion vector estimating apparatus of the present invention will be described below in detail with reference to 
the accompanying drawings. 

Fig. 1 is a block diagram illustrating the structure of the motion vector estimating apparatus according to the first 
45 embodiment of the present invention. Referring now to Fig. 1 , the motion vector estimating apparatus in the first em- 
bodiment is composed of a current picture data storage unit 1 , a reference picture data storage unit 2, a block matching 
unit 3, a motion vector monitoring unit 4, and a motion vector storage unit 5. 

The current picture data storage unit 1 stores image data of a current picture and pixel data C for pixels of a current 
block are read out from the storage unit 1 onto a signal line 101 by the block matching unit 3. The reference picture 
so data storage unit 2 stores image data of a reference picture and pixel data for pixels of a reference block within a 
predetermined search window are read out from the storage unit 2 onto a signal line 121 by the block matching unit 
3. The reference picture may be a previous picture to the current picture, or a different picture useful for estimating the 
motion vector. 

The block matching unit 3 reads out the current block pixel data from the current picture data storage unit 1 and 
55 the reference block picture data from the reference picture data storage unit 2 for a block matching process. Then, the 
block matching unit 3 performs an operation for the block matching process indicated by the following equation (1 ) to 
the current block pixel data and the reference block pixel data. The block matching process is performed all the refer- 
ence blocks of the search window. 
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D( *' m) = ^l /? ^ +/ - , '"' + J ''- 1 )C(/,y)| equation (1) 

where is one pixel data of the current block. ^ - is one pixel data of the current block, and M and N are the number 
ofcoiurnns and the number of rows in the current block, respective.y. The reference block having the Ix mumTkeH 
hood, ..... a minimum value as a result of the block matching process operation is determined fa! estimating a moTon 

, t nrI^T'T VeC, °, r S ' 0raSe UnU 5 St ° reS P fevious| y estimated motion vectors. The motion vector monitoring unit 4 
stores the motion vector ou.pu.ted from the block matching unit 3 in the motion vector storage unit 5 Also the mo"ion 

vec o ,T rrr" 4 ° UtPU,S 3 S6arCh Wind ° W deSi9nati ° n da,a «° the block matchi "9 ""it 3 based on he mot on 
" ^ T 39 : ^ °' her W ° rdS ' * he m ° ,i0n Vec, ° r monitorin 9 unit 4 determines estimation S>ry 
of the mot on vectors based on the motion vectors stored in the motion vector storage unit 5 The motion vector mon 

ThansThe moTontT P ° Si,i ° n °' ,he SearCh based on ' he ^nl^Z^Z^ 

That is the motion vector monitoring unit 4 refers to the motion vector storage unit 5 to determine whether the number 

of piously es.,mated motion vectors exceeds a predetermined number. When it is determined Ttha the numbTr o 
previously es..mate dmo .ion vectors is less than the predetermined number, the motion vector mon^^^^^ 
the previously 'estimated motion vectors and determines optimal size and shape of the search window based on the 
number 5e r^v" ' S dele ™ ed " a * *• "umber of previously estimated moL vectors exceeds the predettmined 
number, the motion vector monitoring urn. 4 produces a 2-dimensiona. map in which each of the previously estimated 

de^e ZZT " 7 aPPed *T? ° n US h ° riZOn,al and thS V6r,iCal Va,U6S Thus ' the "»tion vecfa^iL^M 
determines optimal size and shape of the search window such that the search window includes the historv of he 
previously est.mated motion vectors. Also, the motion vector monitoring unit 4 calculates average v^ues ol changes 
of the piously estimated motion vectors in the horizontal and vertical directions, respective* and determines an 
optimal .po.rt.on o the search window such that, for example, a center position of the search window is ^shtZ based 

the slTZlV thlS T SiZ8 ' Shape and POSi,i ° n ™* be ^ependently d.t.rrrUnSSnt.^S? 

the size, shape and position may be collectively determined wu^ry, 

Thus 7 th^e°shaoL° a nTlT 9 Tl* ° U,P ? ^ desi 9 na,ion data 161 to the b.ock matching unit 3. 

data tTZ^Z T P m °l f 6arCh W ' ndOW bS Chan9ed in r6Sponse to the search window designation 
data for every frame, for every block, or ,f necessary. Therefore, the search window determining process is not a bottle 
neck to restrict the processing speed of the block matching process. P 

In this manner, the shape, size and position of the search window is determined based on the estimation historv 
of he pre™, motion vectors. Then, the block matching process is carried ou, between me"u^Tb™SnS^S 
of the reference blocks within the search window to detect a motion vector. 

Also, in the present invention, the size, shape and position of the search window may be adaptively determined 
Therefore, the size, shape or position of the search window is changed determined. 

Fig. 2 is a block diagram illustrating the structure of the block matching unit 3 according to the first embodiment 
of the present invention. Referring to Fig. 2, the block matching unit 3 is composed of a conXrVc^ll^^l 

^^7^^!^^^ T 3 ^ ' 0rmed ° n 3 s - ic ° nd -'-hip. However, only the block ma.c^g 

circuit may be formed on the semiconductor chip. 

^ control circuit 10-1 receives the search window designation data 161 from the motion vector monitoring unit 
4 o read the current block pixel data from the current picture data storage unit ^ and the pixel date ToTeZ of t Z 
reference blocks within the search window P ach of ,he 

N roZT^VToliZf^ * composed of a plurality of processing elements 21 arranged in M columns and 
N rows (M and N are positive integers, respectively, and M = 4 and N = 4 in this example) The processina elements 
2 are cascade^onnec.ed in units of rows and the rows are cascade^onnected. The L. processing e eLm PE " 

S^^VoTTn?. C ° n rf 1° 3 min, ' mUm V3,Ue de,8C,in9 CirCUi ' 25 - The minimUm ->ue deteTng S 
l,nHnZ J ° , . 1 Ce b ' OCkS haV ' ng lhS minimum va,ue in ,he block matching process within the search 

window and outputs the minimum value reference block as a motion vector. 

uJ { ,S n °' rec J uir f that ,he number of Processing elements is equal to the number of pixels of the current block 
However „ the total number of processing elements PE MN is made equal to the number of pixels of the current bkSs 
FIFO un,ts which are requ.red in the conventional system can be omitted. As a consequence the chip area of me oT«k 
matching unit 3 can be reduced, and the power consumption can be reduced 

intea^more rn^Tf r? ^ m 5" 1 ? C ° mP ° Sed °' (P + Q) erence block P ixel da,a buses < P ^isa positive 
NoTZmJ,* h ^ l 9 Z UP ° f M SeleCt ° rS ' and 3 S6COnd 9r ° up of N selec,ors ln tnis sample, the reference 
block pixel data buses are the Rv1 . Rv2, Rh1 and Rh2 reference block pixel data buses 301 302 303 and 304 inThis 
example. Sequences of pixel data for the reference blocks are supplied to the Rvl , Rv2, Rh1 and R^2 refere^e bS 
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pixel data buses 301 302, 303 and 304 from Ihe reference picture data storage unit 2. Selectors 2211 to 2214 are 
provided as the first group of M selectors and are connected to the Rv1 and Rv2 reference block pixel data buses 301 
and 302 Also selectors 2221 to 2224 are provided for the second group of N selectors and are connected to the RM 
and Rh2 reference block pixel data buses 303 and 304. The selectors 2211 to 2214 and 2221 to 2224 are controlled 
5 in response to control signals from the control circuit 10-1 , respectively. 

The block matching circuit 300-1 is further composed of Rh reference data buses 206-1 to 206-4, Rv reference 
data buses 207-1 to 207-4 and current data buses 208-1 to 208-4. Each of the Rh reference data buses 206-1 to 206-4 
is provided for one of the selectors 2221 to 2224, and each of the Rv reference data buses 207-1 to 207-4 is provided 
for one of the selectors 2211 to 2214. Each of the reference data buses 206-1 to 206-4 and 207-1 to 207-4 transfers 
w one reference block pixel data selected by the corresponding selector. Each of the current data buses 208-1 to 208-4 
are provided for one of the rows of processing elements 21 in this example, and the current data buses 208-1 to 208-4 
are connected to each other. Each of the plurality of processing elements is also connected to a corresponding one of 
the Rh reference data buses 206-1 to 206-4, a corresponding one of the Rv reference data buses 207-1 to 207-4, and 
a corresponding one of the current data buses 208-1 to 208-4. It should be noted that although the current data buses 
is 208-1 to 208-4 are commonly connected in the structural example shown in Fig. 2, the current data buses 208-1 to 
208-4 are not always commonly connected. 

The block matching circuit 300-1 is further composed of a circuit 11. The circuit 11 switches the pixel data read 
from the reference picture data storage unit 2 between the reference block pixel data buses 301 to 304. The circuit 11 
may includes shift registers, each of which latches one row of reference block pixel data and outputs the pixel data of 
20 the row one by one. Thus, the read operation from the reference picture data storage unit 2 can be simplified. However, 
the reference block pixel data buses 301 to 304 may be directly connected to the reference picture data storage unit 
2 In this case, the circuit 11 can be omitted and the read operation is controlled by the control circuit 10-1 . 

Fig 3 is a block diagram illustrating the structure of each of the plurality of processing elements 21 and the pe- 
ripheral portion in the first embodiment. Referring to Fig. 3, the processing element (PE) 21 is connected to the output 
25 port Do of a previous stage of processing element 21 via a signal line 203 and the input port Di of a next stage of 
processing element via a signal line 204. The processing element is connected to the corresponding current data bus 
208 at a terminal C, and to the corresponding Rh and Rv reference data buses 206 and 207 through a selector 23, 
which is not shown in Fig 3. The selector 23 is controlled in response to a control signal from the control circuit 10-1 
One of reference pixel data on the corresponding Rh reference data bus 206 and reference pixel data on the corre- 
30 sponding Rv reference data bus 207 is selected by the selector 23. Then, the processing element 21 performs a 
calculation in response to a control signal from the control circuit 10-1 using the current pixel data received from the 
corresponding current data bus 208 and the selected reference pixel data. In the calculation, an absolute value of a 
difference between the current pixel data and the selected reference pixel data is calculated. In response to a control 
signal from the control circuit 10-1, the calculation result is added to the data received at the terminal Di from the 
35 previous stage of processing element. The adding result is outputted from the terminal Do for the next stage of process- 
ing element such that pipeline processing is allowed. The calculation and the addition are performed synchronously 
between the processing elements in response to the control signals or clock signals from the control circuit 10-1 Thus, 
the block matching process can be performed in a pipeline manner. 

Fig 4 is a block diagram illustrating the structure of the processing element as a minimum unit for constituting the 
40 block matching circuit 300 shown in Fig. 2. Referring to Fig. 4, the processing element 21 is composed of registers 
11a 11b and 11c a calculating element 12 and an adding element 13. The registers 11a, 11b and 11c are pipeline 
registers The calculating element 12 calculates an absolute value (T = I IR - CI I ) 205 of a difference between the 
pixel data CI of the current block which is supplied from the current data bus 208 via a signal line 202 and the pixel 
data IR of the reference block which is supplied from the selector 23 via a signal line 201 . The adding element 1 3 adds 
45 the data Di received at the Di port via the signal line 203 and the absolute value T 205, namely, (Di + T), and then 
outputs the summation from the Do port onto a signal line 204. In this manner, the pipeline process.ng is performed. 

Fig 5 is a block diagram illustrating the motion vector monitoring unit 4. Referring to Fig. 5, the motion vector 
monitoring unit 4 is composed of a filter 1 201 , a search window determining circuit 1 202, and a window data storage 
unit 1203 The filter 1201 filters the motion vector estimated by the block matching unit 3. The filtered motion vector is 
so stored in the motion vector storage unit 5. The search window determining circuit 1202 refers to the motion vector 
storage unit 5 to determine a search window from the estimation history of previously estimated motion vectors stored 
in the motion vector detecting unit 5. Also, the search window determining circuit 1202 stores a data designating the 
determined search window in the window data storage unit 1 203. It should be noted that the filter 1 201 may be replaced 
by another filter, or may be omitted. . 
55 The above-explained search window designating data is composed of, for instance, an upper left position, height 

and width of each of rectangular reference regions of the determined search window, which will be described below. 

An example of the search window used in the present invention will be described below with reference to Figs. 6A 
and 6B Fig 6A is a diagram schematically illustrating the search window and Fig. 6B is a partial expanded diagram 
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of he search w.ndow shown .n Fig. 6A. Referring to Fig. 6A, the search window H00 is composed of a plurality of 
rectangular reference reg-ons 110!, 1102, 1103, and 1104, which overlap one after another Tha. is the efe ence 
reg.ons 1101, 1102, 1103. and 1104 are synthesized to form the search window 1100. The search window „ 00 n^ 
s P 'Z e l™ en 7 ' S n °! neCeSSari, y re ^ ired to *™ a ^.angular shape, unlike the conventional system ,n the block 
5 niching un,. 3 according to the f.rst embodiment of the present invention, even when the search window is no. ~ 

s^r^sr^ and/or a convex re9ion ' this search window may - — - - ~ 

.h ar 2 n Th 6 °' her T d ' ! COnvenlional search wind ™ 1 " 0 is illustrated by a dot line in Fig. 6A and has a rectangular 
w TuTZl S6arCh W,nd ° W 1100 iS diff6rent fr ° m tha * °' the — entiona. example in regions 11,1, in 2 n^3 atd 
b ik mih 9 '° nS de,e / mined bV S6afCh Wind ° W dete ™™9 circuit 1 202 to be unnecessary to perform the 

J " "l3 and nT/S ! T T T*™' ^ **** PrOC6SS iS n °« P erf °™ ed «° < h * regions" 1 

1112 _ 1113 and 1114. Therefore, the block matching process can be completed at high speed. Even when the block 

tafn d "a? SHhTs'haoe TT' ^ 8PMd °' ^ P^-sing elements 21 can b S 

« conslptioa * "'"^ 9iVe adV6rSe inflU6nCe t0 the P rocess ,ime and the power 

, ho STJ"" 9 , I** t0 Fi ?-- 1, m ° ti0n VSCt0r moni,orin 9 ""it 4 monitors and stores the motion vectors obtained from 
^ 2e ani/n r 9 ""'I I" ^ ^ ^ unB 5 ' The m ° ,ion vec,or «"» « changes fhe shape 

TlTf? P S "r °' ' he f SeafCh Wind ° W baS6d ° n the mo,ion vec(ors stored « «he motion vector storage unit 5 i' 

^tTSiTS7h P T iOUS ' y eS ' imated m0,i ° n VeC, ° rS BeC3USe ,h6 blOCK ma,Chin 9 Unit 3 according to 
he hrst embodiment of he present mvent.on ,s applied to the search window having any concave regions and convex 

reg^ns, the efficiency of the block matching process is no. decreased. As a consequence, since the estimation riZ 
of .he previously estimated motion vector is more effectively reflected, there is no need to earn/ out the blo^k ma.S 
process .o, he waste reference regions such as the regions 1111, 1112, 1113 and 1114. Thu..le^l^St^S 
time and the resultant power consumption can be reduced. resuiiani processing 

Also, even if the size of the search window is changed based on the estimation history of the motion vectors the 
number of reference regions is merely increased and/or decreased. There is no change in the rule o le sequences 
Ihnd 3 * PPtod T b '° Ck ma,Chin9 Unit 3 ' 38 d6SCribed later As a consequence, in acco^^TS 
unt 3 comX S ' Ze ' 6 8nd/0r P ° Si,i0n °' S6arCh Wind ° W ^ Chan9ed W ^° Ul makin9 ,he ^^k matching 
*> In the present invention, the size, shape, and/or the position of the search window is adaptively changed based 

which e aT ,0 r 7 V6Ct0rS AS 3 r6SU "' b ' OCk matChin9 Pr ° CeSS iS not P erformed ^ the regLs 111 1 to in 4 
which are not ong.na.ly required to perform the block matching process. Therefore, the processing time and amount 
can be reduced. As a consequence, the power consumption required for the block matching process can be recced 
ss rmpXe/regioTs ^ ^ ^ the b '° Ck Aching process operation .o the 

embodiment 6 ? t r t[ ° n °' b,OCk matChin9 Pr ° C6SS in the m ° ,i0n VeCt ° r estimatin 9 a PP aratus according to the first 
embodiment of the present invention will be described below in detail. In this example, it is assumed tha. the curren 
btock ,s composed of 16 pixels of 4 x 4 (C,, to C 44 ). The search window is shown in Fig. 6A The reference ZTon 
window S6arCh T d ° W iS C ° mP0Sed °' 49 piX6lS ° f 7 X 7 (R - to R ? and the '^ence region noToHhe sea ch 

^nVeZr :i ir^rj. v 15 A,so ' " ,s assumed that ,he biock — *> — ™ 

.he cu 9 ent S blork S re ' ati ° n ^ ^ PreVi ° US ' y eS,ima,6d m0,i ° n VeC '° rS ' the S6arch window ' the reference block a "d 

unit Sand StlminT^lT" ma,C h hin . 9 P r0C ^ SS ' the motion ve c.or monitoring unit 4 refers to the motion vector storage 
ZLlTnT , I W ' nd0W baSSd ° n th8 m ° ,i0n VeCt ° rS Previously estimated, .n this case, the motion 

vec or mon tonng urn. 4 determines from the previously estimated motion vectors that there is no case where a new 

Fia GA ThT fn , ° U ' " r6Qi0nS 1 1 1 1 ' 1 1 1 2. 1 1 1 3 and 1 1 1 4, and determines the search window, as'how^ 
each o, 06 re9 T 1 1 01 t0 1 1 04 COns,itutin 9 the se arch window are determined and data designing 

each of the reference regions 1101 to 1104 is stored in the window data storage unit 1203. Also the data desiqna no 

lTZ° e Tin S6arCh windowdesi 9 na «- da ^ is supplied to theL.ro. circuit 10-1 o, ietoSSSS 

unrt 3. The control arcu,. 1 0-! determines pixel data to be read out from the current picture data storage unit "and I the 
reference p,c ure data storage unit 2. The pixel data read out from the curren, picture data storagfunTl and the 

hfntr T S '° ra9e Uni ' 2 SUPP ' ied '° lhS b ' OCk ma,Chin 9 circuit 300 " 1 of block matching uni 3 Thus 
the block matching process is started. y nus " 

Fios F 8A a 8 nHflR d f Scherna,icallv i»«8trating the pipeline processing for the block matching process In 

Figs. 8A and SB^an abscissa denotes t,me. Symbols C,., to C 4 . 4 represent pixel data of the curren. block Symbols 
1 ,0 R 7.7 represent p.xel data of the reference region 1101, and symbols S, , to S 44 and R, , ,o R 77 and so on 
represent pixel data of the reference region 1102 as shown in Fig. 68. Furthermore'each of symbol 7o™ <c ^d" 
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represents data which indicates a likelihood of the corresponding reference block and which is supplied to the minimum 
value detecting circuit 25. The likelihood data is calculated based on the following equation (2): 



5 D(k,m)-^^\R(k+i-lm + j-l)-C(iJ$ equation (2) 

In this formula, symbol "k" is selected to be 1 to 4, and symbol "m" is selected to be 1 to 4. 

10 As apparent from Figs. 8A and 8B, four different sequences of pixel data 81 to 84 supplied to the processing 

elements are shown by symbols O.x.G and A. As illustrated in Figs. 8A and 8B, the sequences of pixel data 81 to 84 
are sequentially supplied to the block matching circuit 300-1 . That is, at a time 1 , a pixel data C, of the current block 
is supplied to the processing element PE^ ^ via the current data bus 208-1. At the same time, a pixel data R^ of the 
reference block of the reference region 1101 (Fig. 6B) is supplied to the processing element PE, A from the sequence 

is of pixel data 81 In the processing element PE, v these pixel data are latched by the registers 11a and 11b and an 
absolute value of a difference between these pixel data, i.e., I R 21 - C u I is calculated by the calculating element 12. 
Because the processing element PE t , is a head element of the cascade connection so that the data Di of "0" is supplied 
from the register 11c, the processing element PE 1 outputs the absolute value of the difference from the output port 
Do to the input port Di of the next stage of processing element P.E 2<1 at a time 2. 

20 At the time 2, a pixel data C 2 A of the current block is supplied to the processing element PE 2A from via the current 

data bus 208-1 . The pixel data C 2 , of the current block is supplied to the processing element PE 2<1 - At the same time, 
a pixel data R 2J of the reference block is supplied to the processing elements PE^ and PE 2i1 from the sequence of 
pixel data 81. 

In the processing element PE^ j, since the pixel data C 1 n has been already latched by the register 11a, the pixel 
25 data R 2 t is latched by the register 11 b and then an absolute value of a difference between these pixel data R 2 -, and 
C A t, i.e., l R 2 1 - C 1 I is calculated by the calculating element 1 2. The processing element PE lt1 outputs the absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 2t1 at a 
time 3 Also, in the processing element PE 21 , the pixel data C 2A and R 2<1 are latched by the registers 11a and 11b 
and an absolute value of a difference between these pixel data, I R 2 A - C 2A I is calculated by the calculating element 
30 12. The adding element 13 adds the absolute value of the difference to the calculation result I R^-, - C i n I supplied 
from the processing element PE A ^ via the input port Di. The processing element PE 21 outputs the summation absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 3-1 at a 
time 3. 

After a time 4, the processing elements PE, v PE 2 PE 3 , and PE 41 are in the following holding states. That is, 
35 in the processing element PE 4 v a data of I R 1 n -C, , I + I R 2 , - C 2A I + I R 3<1 - C 3>1 I + I R 4<1 - C 4 1 I is stored. Similarly, 
in the processing element PE 3 v a data of I R 2 , - C, A I + I R 3>1 - C 2A I + I R 4i1 - C 3>1 I is stored. In the processing 
element PE 2J , a data of I R 31 - C 1 ^ I + I R 4>1 - C 2>1 I is stored. In the processing element PE 1/h a data of I R 41 - C 1 -i 
I is stored. 

At a time 5, a pixel data 2 of the current block is supplied to the processing element PE 1 2 from via the current 
40 data bus 208. The pixel data 2 of the current block is latched in the processing element PE, 2 . At the same time, a 
pixel data 2 of the reference block is supplied to the processing elements PE 1(1 and PE 1-2 from the sequence of 
pixel data 82* Also, a pixel data R 5J of the reference block is supplied to the processing elements PE 2V PE 3<1 and 
PE 4 1 from the sequence of pixel data 81 . 

in the processing element PE lt1 , since the pixel data has been already latched by the register 11a, the pixel 
45 data R 1 2 is latched by the register 11b and then an absolute value of a difference between these pixel data R 2<1 and 
C-j 1 , i.e', I R-( 2 - C-| 1 I is calculated by the calculating element 12. The processing element PE^ outputs the absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 2A at a 
time 6. 

Also, in the processing element PE 21 , since the pixel data C 2 has been latched by the. register 11a, the pixel 
so data R 5 i is latched by the register 11b and an absolute value of a difference between these pixel data, I R 5<1 - C 2A I 
is calculated by the calculating element 12. The adding element 13 adds the absolute value of the difference to the 
calculation result supplied from the processing element PE^ via the input port Di. The processing element PE 2A 
outputs the summation absolute value of the difference from the output port Do to the input port Di of the next stage 
of processing element PE 3 ^ at the time 6. 
55 Also in the processing element PE, 2 , since the pixel data C, 2 and R 12 are latched by the registers 11a and 11b 

and an absolute value of a difference between these pixel data, I z - C, 2 I is calculated by the calculating element 
1 2. The adding element 1 3 adds the absolute value of the difference to the calculation result supplied from the process- 
ing element PE 41 via the input port Di. The processing element PE 12 outputs the summation absolute value of the 
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difference from the output port Do to the input port Di of the next stage of processing element PE 2 2 at the time 6 
The operation described above is repeated from the time 6 to a time 16 

m« At a ,rrJi lh !. ,ik rf ,ih00d Dl - 1 ° f reference b,ock havin 9 the P*el *ta of R, , to R 44 is obtained. In this 
manner, the likelihoods D Kt to D 4 . 4 for the reference blocks of the reference region 1101 can be sequentially obtained 
in a time interval from the time 1 7 to time 32 in a pipeline manner. 

At the same time, at the time 17, a pixel data S M of the next reference region 1102 (Fig. 6B) is supplied to the 
processing element PE M from the sequence of pixel data 83 with no waste rest. The pixel data S, , is used together 
with the P-xe data C„ to calculate an absolute value of a difference between the pixel data s, . and C, , In this 

r^nlTi m Z matCh f 9 P fOCesS to the next reference ^on 1 1 02 is started in a continuous state from the reference 
region 1101 with no wasted processing time. 

In this manner, 16 pixel data of the current block in maximum are used at the same time. However because the 
same current block pixel data are used in the same processing elements, the number of pixel data of the current block 
to be supplied at the time can be reduced to "1 " if the pixel data already supplied is latched. In other words in the block 
matching process shown in Figs. 8A and 8B. since the block matching process is carried out to the same current block 
the entry of the pixel data C,., to C 4 4 of the current block to the processing elements PE, , to PE 4 , is complete from 
the time 1 to the time 16. A.so, the number of pixel data to be entered at the same time is se.ected to be T 

On the other hand, the pixel data of the reference region are simultaneously supplied to 1 , 2, 3 or 4 processina 
elements. The sequence of pixel data 81 and the sequence of pixel data 82 are the pixel data present within the same 
reference region 1101 shown in Figs. 6A and 6B. Also, the sequence of pixel data 83 and the sequence of pixel data 
84 are the pixel data present within the same reference region 1 1 02. There is no problem even when these two reference 
regions 1101 and 1102 are overlapped with each other. reierence 

Ac ™ rdi " 9 10 ,h€ * fi : st embodiment of the present invention, the size, shape, and/or position of the search window 
8 flexibly determ.ned. Therefore, the bus wiring lines of the block matching circuit 300-1 are formed to be adaptive for 
the search wrnttow which is divided into a plurality of reference regions. As a consequence, even when the block 
matching process is performed in the parallel manner, this parallel degree is not decreased. The block matching process 
can be performed w.th.n a short time, and the power consumption can be reduced. Also, according to the first embod- 
iment of the present invention, even when the search window is made small, the motion vector can be detected in-high 
precision. 

Next, the supply of the pixel data of the reference region will be described below. Figs. 9A shows the cascade 
ZnS.'^rJ ^P^f 551 '" 9 6lementS PE M to PE 4.4. as shown in Fig. 2. Figs. 9B to 9F show the manner in which 
the pixel data of the reference regions are supplied at the times 1 , 2, 5, 9 and 1 7 of Figs 8A and 8B 

Referring to Fig. 9B, the pixel data R, , of the sequence of pixel data 81 is supplied on, for example the Rv1 
reference block , pixel data bus 301 . At this time, the selector 221 1 is controlled in response to a control signal from the 
control circuit 1 0-1 to select the pixel data on the Rvl reference block pixel data bus 301 such that the selected pixel 
data - transferred onto the reference data bus 207-1 . Also, the selector 23 corresponding to the processing element 
, 1 c ° ntro " ed ,n response to a control signal from the control circuit 10-1 to select the pixel data on not the Rh 

^ZITpe™ ^ ^ re ' erenC6 ** *" ^ ^ ^ PiX8 ' ^ 030 be SUpplied to the P rocess " 

Next, as shown in Fig. 9C, at the time 2, the pixel data R 21 is supplied in the same manner as the pixel data R, , 
The selectors 23 corresponding to the processing elements PE,, and PE 2 , are controlled in response to control 
s.gnals from the control crcu.t 1 0-1 to select the pixel data on not the Rh reference data bus 206-1 but the Rv reference 
data buses 207-1 and 207-2. Thus, the pixel data R,, can be supplied to the processing elements PE, , and PE 9 

Next, as shown in FigJiD. at the time 5, the pixel data R 5 ., of the sequence of the pixel data 81 is supplied on the 
Rv reference block pixel data bus 301 in the same manner as the pixel data R, ,. Also, the pixel date R, 2 of the 

ITctT^ 75 82 15 SUPP ' ied ° n ' f ° f eXamP ' 6 ' th8 RV feferenCe block P ixel da,a bus 302 ^ this time, the 

Z n , controlled ,n response to a control signal from the control circuit 10-1 to select the pixel data R, , on 
the Rv reference block p.xel data bus 301 such that the selected pixel data is transferred onto the reference data bus 
T'l; ^ S ?i ^ 23 , COrres P° ndin 9 t0 the Pressing elements PE 2 ., to PE 4 , are controlled in response to control 
signals from the control circuit 10-1 to select the pixel data R 5 , on not the Rh reference data bus 206-1 but the Rv 
reference data buses 207-2 to 207-4. Thus, the pixel data R 5 , can be supplied to the processing elements PE 2 , to 
rnn,L l , corres P° ndin 9 '° processing elements PE,., to PE, 2 are controlled in response to 

206 ? >IZ « , C °T l drCUit 1 °" 1 ,0 Se ' eCt * he PiXe ' data R '-2 ° n not the Rh reference buses 206-1 and 
206-2 but the Rv reference data bus 207-1 . Thus, the pixel data R, 2 can be supplied to the processing elements PE, , 

cinn!^?' 88 * hown 10 Fig ' 9E ' at the time 9 - ,he se,ec, °rs 2211 , 2221 and 2222 are controlled in response to control 
signals from the control c.rcuits 10-1 such that the pixel data R, 3 of the sequence of the pixel data 81 is supplied to 

TlZZ hTk on"/ 207 ' 1 th8 PiXe ' d3,a Rs 2 °' thS Sequence of ,he P ixel data 8 2 is supplied to the Rh 
reference data buses 206-1 and 206-2. Then, the selectors 23 corresponding to the PE, , to PE, 3 are controlled by 
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the control circuit 1 0-1 such that the pixel data R, , 3 is supplied to the processing elements PE , t , , PE , 2 and PE 13 , 
the pixel data R 5 2 is supplied to the processing elements PE 2 t to PE 4J and PE 2 2 to PE 4 2 . 

Next as shown in Fig. 9R at the time 17, the sequence of pixel data 81, the sequence of pixel data 83 and the 
sequence of pixel data 82 are supplied to the Rh1 reference block pixel data bus 303, the Rh2 reference block pixel 
5 data bus 304, and the Rv1 reference block pixel data bus 301 , respectively. The selectors 2211 to 221 4 are controlled 
' in response to control signals from the control circuit 10-1 such that the pixel data R 5 4 is transferred to the Rv reference 
data buses 207-1 to 207-4. The selectors 2221 to 2224 are controlled in response to control signals from the control 
circuit 10-1 such that the pixel data S A , is transferred to the reference data bus 206-1 and the pixel data R 1-5 is 
transferred to the reference data buses 206-2 to 206-4. Also, the selectors 23 corresponding to the processing elements 
io PE 2 -j to PE 4 ! , PE 2 2 to PE 4 2 , PE 2 3 to PE 4<3 and PE 2 4 to PE 4i4 are controlled by the control circuit 1 0-1 such that the 
pixel data R 5 4 is supplied to the processing elements PE 2 ^ to PE 41 , PE 2 2 to PE 4 2 , PE 2 3 to PE 4 3 and PE 2 4 to PE 4 4 . 
The selectors 23 corresponding to the processing element PE,., is controlled by the control circuit 10-1 such that the 
pixel data S, , is supplied to the processing element PE^ and the selectors 23 corresponding to the processing 
elements PE| 2 to PE! 4 are controlled by the control circuit 1 0-1 such that the pixel data ^ 5 is supplied to the process- 
es ing elements PE^ 2 to PE^ 4 . 

Fig. 10 is a diagram illustrating a method for supplying the pixel data to the processing elements when four kinds 
of pixel data are supplied to the processing elements at a time. As shown in Fig. 10, the processing elements are 
divided into four regions 1121, 1122, 11 23 and 11 24 by boundary lines 1131, 1132 and 1133. 

The example shown in Fig. 10 corresponds to the state of the time 22 in Figs. 8A and 8B. In this case, the supply 
20 of the pixel data from 4 reference block pixel data buses 301 to 304 to 4 regions 11 21 to 11 24 in Fig. 10 are controlled 
by moving the boundary lines 1131 to 1133. That is, the selectors 2211 and 2212 are controlled by the control circuit 
10-1 such that the sequence of pixel data 84 can be supplied to the Rv reference data buses 207-1 and 207-2. The 
selectors 2213 and 2214 are controlled by the control circuit 10-1 such that the sequence of pixel data 81 can be 
supplied to the Rv reference data buses 207-3 and 207-4. The selectors 2221 and 2222 are controlled by the control 
25 circuit 10-1 such that the sequence of pixel data 83 can be supplied to the Rh reference data buses 206-1 and 206-2. 
The selectors 2223 and 2224 are controlled by the control circuit 10-1 such that the sequence of pixel data 82 can be 
supplied to the Rh reference data buses 206-3 and 206-4. If the selectors 23 corresponding to the processing elements 
PEt a to PE 44 are controlled, the four kinds of sequences of pixel data can be supplied to the processing elements 
PE,\ to PE 4 4 as shown in Fig. 10. That is, in the block matching unit 3 according to the first embodiment of the present 
30 invention, the four kinds of pixel data can be supplied to the processing elements at the same time by properly moving 
boundary lines 1131 to 1133. 

In this manner, since 4 kinds of pixel data over the reference regions in maximum and also one kind of pixel data 
of the current block can be supplied, the operation rate of the processing elements can be increased to 100%. 

Also, even when the block matching process is carried out to a plurality of reference regions, the block matching 
35 process can be successively executed while changing the reference regions, in the state in which the operation rate 
of the processing elements are maintained at 100%. As a result, in this embodiment, the time period required for 
executing the block matching process can be shortened. When it is supposed that the time period for executing the 
block matching process is a predetermined constant value, the operation frequency of a clock signal for the block 
matching unit 3 can be reduced, as compared with that of the conventional system. As a consequence, the power 
40 consumption ot the block matching circuit can be reduced. 

Also, since the time period for performing the necessary block matching process can be shortened, the block 
matching process can be carried out in the wider reference region. 

According to the present invention, the pixel data of the reference region is supplied via a hierarchical selector 
structure, i.e., the two stages of selectors to the bus wiring line connected to the processing elements as the minimum 
45 units of the block matching circuit 300-1 . As a consequence, even when the block matching process is carried out to 
the plurality of reference regions 1101 to 1104, it is possible to execute the block matching process in such a manner 
that the pipelines of the block matching circuit 300-1 are continuously filled without a wasted processing time. That is, 
the parallel degree of the block matching process is not decreased. As a consequence, even when the search window 
is divided into the plurality of reference regions, the processing time of the block matching process is not increased 
50 and the power consumption is not increased. 

Also the local wiring lines 203 and 204 in Fig. 4 are employed in the wiring line system used to connect the 
processing elements as the minimum units of the block matching circuit 300-1. Therefore, when the minimum value 
or likelihood is calculated from the summation of the difference absolute values, the total number of the wiring lines 
from all the processing elements to the minimum value detecting circuit 25 can be suppressed. As a consequence, 
ss the motion vector estimating apparatus can be realized with the smaller chip area and with power consumption reduced. 

Next, the motion vector estimating apparatus according to the second embodiment of the present invention will 
be described below. Fig. 1 1 shows the structure of the block matching unit 3 of the motion vector estimating apparatus 
in the second embodiment. Referring to Fig. 11 , the block matching unit 3 is composed of a control circuit 10-2 like the 
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control circuit 10-1 in the first embodiment, and a block matching circuit 300-1 

M „ Jno b t ,0Ck ma ' Ch ! n9 C ' rCUi ' ™' 2 inC ' UdeS P rocessin 9 elements PE, to PE M N of M columns and N rows (M and 
In ilT TJ, 9Ven , "1 69erS ' reSpeC,iVel ^- each °' which ■» 'he same as the processing element in the firs, embodlem 
n^L ! • 13 4 N ,S 8 ThS CUrren ' b ' OCk iS C ° mpOSed 0f < M x N > P ixels ' 4 * 8 in this exam^e The 
Sn^S, e ST- H 8,B CaSCa f e - Connec,ed via the output por, and the input port in units of rows. The firs, row is 

is Connie h TT\ 3 f IF ° Uni ' 24-1 " C88Cade COnneC,i ° n manner A minimum value ^tecting circuit 28 -1 

s connected to .he las. stage of processing element PE 4 . 3 via a FIFO uni. 24-2. The second row is connected to the 

"he lasTsTaoe 7° " C ° nneC,i ° n A min '' mUm Va,Ue de,eclinQ circui ' « connected 

1 a fTo n* 26 r^r'H ^ ^ 3 RF ° UPit 25 ' 2 - A ' SO ' ,he Mf,h fOW iS COnnec,ed to the "ven* row 

Z L F , , f connection manner. A minimum value detecting circuit 28-3 is directly connected to 

the las. stage o, processmg element PE 47 . The sixth row is connected to the eighth row via a FIFO uni, 27-1 in cascade 
connection manner. A mi n,mum value detecting circuit 28-4 is directly connected to the las. stage o, processing elment 

circu'" ^Tfnnut iST '? ^ V ' o^* pfOCeSSm * elemen,s of the odd ^ " an upper half of block ma.ching 
circuit 300-2 the input and output ports 203 and 204 are cascade^onnected via the FIFO units 24-1 and 24-2 In the 

SS^d'Si T°T °T 6Ven r ° WS ' n UPPef ha " °' ,he b,OCk matChin 9 Circuit ^ tne inP"t and output ports 
203 and 204 are cascade-connected v,a the FIFO uni.s 25-1 and 25-2. In the processing elements of the odd rows in 

.he To ^TAn T ma,Ch,n9 Ci 7" 30 °- 2 ' lnpU ' and ° U,PUt P ° rtS 203 and 204 are cascade<onne "ed vL 
h! v, , 1 , 6 pr0cess,n 9 elements of ,he even rows in .he lower half of ,he block matching circuit 300-2 

the input and outpu, ports 203 and 204 are cascade-connected via the FIFO uni, 27-1 

as S h T o h wn St i a nng n T berS °' ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 3rS 4 ' 4 ' 20 ' 16 ' 4and 4 ' respectively. 

207 1 "to TnTT ^ b H S ! S ( 206 : 1 10 206 " 8 PfOVid6d f ° r ,he rOWS ' res P e <^e.y, and Rv reference data buses 
207-1 to 207-4 are provided for the columns, respectively. The relation of one of the processing elements and the 
^responding Rh and Rv reference data buses is the same as in the firs, embodiment. The RV reference b ock pixel 
data buses 301 and 302 are provided and one of them is connected to the Rv reference data buses 207-1 to 207 4 
via selectors 2211 to 2214. respectively. The Rh reference block pixel data buses 303 and 304 are provided and one 
of ,hem ,s connected to the Rh reference data buses 206-1 to 206-8 via selectors 2221 to 2228 respectively 

The current data buses 208-1 ,o 208-8 are provided for the rows, respectively, and are connected together It 

shown in F,g. 11. these current data buses 208-1 to 208-8 are not always commonly connected 

on. Srot! PIXel T ,B °\ fOUr Sequences on the Rv and Rh reference block pixel data buses 301 to 304 is supplied to 
one processing element va one of the selectors 2211 to 2214 and one of the selectors 2221 to 2228 and En the 
selector 23 corresponding to the processing element, as in the first embodiment 

35 hinrlT »h 9 , 1 1 ' " I?' 3 ' nUmb6r °' pr ° cessin 9 elements is set equal to the number of pixels, M x N, of the current 
i onn? 96 nUmb6r ' n 6aCh ° f ,hS RFO UnitS Can be decreased. As a result, the chip area of the block matchTno 
circuit 300-2 can be further reduced, and also the power consumption can be reduced 9 

, k A 2°;K n . F .!?" 11 ls assurr, ed that the number of stages of each of the FIFO units 24-1 , 25-1 , 26-1 27-1 is selected 

of the f'.FC fnZZsT* 1 ? ^ * «° b6 < M * N >* + M ' and tha < >he numb ro stage 

an outou^ 306 rom fh. I Vlt V™ X ^ ^ ^ * ° f ^ output 305 from the FIFO unit 24-2, 

an output 306 from the FIFO unit 25-2, an outpu. 307 from the processing element PE M N „ and an output 308 from 

bodimenroHh 9 ^^^^ ^ 6qUa ' '° ° U,PUt 310 ShOWn * ^ 2 Furthe^according to thTsecond em 
bodiment of the present mvent.on, the data entry from the reference regions is scheduled in such a manner that the 
number of stage of each of the FIFO units can be simultaneously reduced. As a consequence theXk Snatch no 
circuit can be realized such that ,he chip area can be saved and the power consumption can be reduced 9 
in Fio r ia C pm„ISlJ ^H PreSen ' ! nVe, : ,i0n iS applied to the MPEG " 2 ( |SO/| EC 13818-2), if the circuit structure shown 
satino l edicZ ! ' ^ * 3t ' he S3me ' ime lhe da,a which are " a frame motion compen- 

r r C ° mpenSat,n 9 P rediction in a structure, a field motion compensa,ing prediction 

Lmn.nf t , 9 Pred ' C, '° n °' m0,i ° n com P ensalin 9 perdition in the field structure, and ,he 1 6 x 8 motion 

compensa ,ng pred.ct.on. In addition, the processing e.ements are divided into four kinds of processing elements as 
he espedve processing elements belonging to the upper odd rows, the upper even rows, the lower odd rows and 
the lower even rows. Therefore, the motion vectors can be simultaneously acquired in response to a plurality of' pre 

connTed fL the o 2 ''^'f 13818 - 2) ' ^ ^ ^ ^ ^ the ^ lines ^ 'he wi ng lines 
connected from the processing elements, and the chip area can be reduced 

described Sow 0 F°o "Tl ^'T^ aCC ° rd ' n9 '° the * hird embodimen « °f the present invention will be 

described below Fig. 12 shows the structure of a block matching circuit 300-3 in ,he third embodiment The block 
matching circu 300-3 inc.udes processing elements o, 8 columns and 16 rows. The structural concept of the block 
matching circuit shown ,n Fig. 12 is the same as that of the block ma.ching circuit in the second embodiment Te 
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difference point is in the number of processing elements. 

Like in the second embodiment, in the processing elements of the odd rows in an upper half of block matching 
circuit 300-3 the input and output ports 203 and 204 are cascade-connected via three FIFO units. In the processing 
elements of the even rows in the upper half of the block matching circuit 300-3, the input and output ports 203 and 204 

s are cascade-connected via three FIFO units, In the processing elements of the odd rows in the lower half of the block 
matching circuit 300-3, the input and output ports 203 and 204 are cascade-connected via three FIFO units. In the 
processing elements of the even rows in the tower half of the block matching circuit 300-3, the input and output ports 
203 and 204 are cascade-connected via three Fl FO units. Each of the above-mentioned FIFO units has the stages of 8. 
The last stage of processing element of the odd rows in the upper half is connected to adders 31 and 32 via a 

w FIFO unit having the stages of 72. The last stage of processing element of the even rows in the upper half is connected 
to the adder 31 and an adder 33 via a FIFO unit having the stages of 64. The last stage of processing element of the 
odd rows in the lower half is directly connected to the adder 32 and an adder 34. The last stage of processing element 
of the even rows in the lower half is directly connected to the adders 33 and 34. The outputs of the adders 32 and 33 
are connected to an adder 35. 

is In this embodiment, in spite of decreasing the number of stages in each FIFO unit, three matching criteria needed 

for MPEG2 i e 16 x 16, 16 x 8 (top) and 16 x 8 (bottom) for the frame structure can be obtained at the same time. 
Also, three matching criteria, i.e., 16 x 16, 16 x 8 (upper) and 16 x 8 (lower) for the field structure can be obtained at 
the same time. 

Fig 13 is a diagram to explain the supply of the reference block pixel data. The reference block pixel data are 
20 supplied from four reference block pixel data buses while changing the boundary lines, as in the first embodiment. 
Therefore, the detail description is omitted. 

Next, the motion vector estimating apparatus according to the fourth embodiment of the present invention will be 
described below. The basic structure of the motion vector estimating apparatus in the fourth embodiment is the same 
as that of the first embodiment. Therefore, the different point will be described. 
25 Fig. 1 4 shows the structure of the motion vector monitoring unit 4' of the motion vector estimating apparatus in the 

fourth embodiment. Referring now to Fig. 14, the motion vector monitoring unit 4' is composed of the filter 1201 for 
filtering the motion vectors obtained from the block matching unit 3, the search window determining circuit 1202, the 
window data storage unit 1 203, and a bit map storage unit 1 204. The filter 1 201 and the window data storage unit 1 203 
are the same as those in the first embodiment. 
30 The search window determining circuit 1 202 determines estimation history of previously estimated motion vectors 

obtained from the motion vector storage unit 5, determine a search window from the estimation history and stores 
pointers designating this determined search window into the window data storage unit 1 203. Also, the search window 
determining circuit 1202 determines bit-map regions 1501 and 1502 as shown in Fig. 15, and then stores this bit-map 
regions 1 501 and 1 502 in the bit map storage unit 1 204. It should be noted that the filter 1 201 is provided, if necessary. 
35 Each of the above-described pointers are composed of, for instance, an upper left position of each of the rectangular 
reference regions of the search window, a height of each reference region, a width of each reference region, and so on. 

With reference to Fig. 6A, the reference regions 1101 to 1104 which form the search window 1100 can be repre- 
sented to have a rectangular shapes. To the contrary, each of the bit-map regions 1501 and 1502 can be defined in 
units of pixels by free curved lines, as shown by white circles in Fig. 1 5. In this case, a 1 -bit flag data can be added to 
40 each of the pixel data of the reference regions 1101 to 1104 within the search window to be supplied into the block 
matching unit 3. The 1-bit flag data indicates whether the pixel data is valid or invalid. 

Fig. 1 6 is a block diagram illustrating the structure of the processing element. The processing element of the block 
matching circuit in the fourth embodiment has a register ltd for fixedly holding a maximum value, and a selector 14 
for selecting one of the output from the register 1 1 d and the output from the adding element 1 3 in response to a control 
45 signal from the control circuit 10-1 . 

When a pixel data is supplied to the block matching unit 3, the control circuit 10-1 checks the flag data of the pixel 
data. When the flag data indicates that the pixel data is contained in the bit map region 1 501 or 1 502 shown in Fig. 1 5 
and is invalid, the control circuit 10-1 outputs the control signal to the processing element for the invalid pixel data to 
be supplied such that the content of the register I1d is outputted from the output port Do via the selector 14. As a 
so result, the reference block containing the invalid pixel data of the bit map regions is excluded from a region where any 
motion vector is to be detected. In this case, the pixel data may be not read out from the reference picture data storage 
unit 2. Therefore, the calculations of the absolute value of difference and addition may be not performed in the process- 
ing element. Thus, the power consumption can be reduced. 

There are cases that the reference regions are limited in the vicinity of the right end, the left end, the upper end, 
55 and the lower end. Even when the limited reference regions are present, the flag data is added to the pixel data in the 
reference regions of the search window regions, and the block matching process is selectively performed based on 
the flag data of each pixel data. Therefore, in this case, the structure of the block matching circuit 3 is not made complex. 
As described above, even when the data width of the reference region is increased by 1-bit at most, there is no need 
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to change the structure of the block matching unit 3. 

In a motion vector estimating apparatus according to the fourth embodiment of the present invention, the block 
matching process is carried out between a certain current block and a reference block within the search window to 
detect a motion vector. The block matching circuit in the fourth embodiment is structured in a similar manner to Fig. 2, 
1 1 or 12. Also, each of the pixel data having the valid flag is supplied in a similar manner to Figs. 8A and 8B. 

In the embodiment of the present invention, as described above, since the search window can be freely determined, 
the useless calculations performed in the processing elements, and the useless data reading operation from the ref- 
erence picture data storage unit 2 are decreased. As a result, the power consumption can be lowered. 

Fig. 1 7 is a block diagram illustrating the structure of the motion vector estimating apparatus according to the fifth 
embodiment of the present invention. Referring now to Fig. 17, the motion vector estimating apparatus in the fifth 
embodiment is composed of a temperature detecting unit 6, a load detecting circuit 7, and a power supply voltage 
detecting circuit 8, in addition to the structure shown in Fig. 1. In this embodiment, the motion vector monitoring unit 
4' has the bit map storage unit 1204, like the fourth embodiment. 

As described in the first embodiment, the block matching circuit is realized on a semiconductor chip. The temper- 
ature detecting circuit 6 detects a temperature of the chip on which the block matching unit 3 is formed. Then, the 
temperature detecting circuit 6 generates a search window restriction data based on the detecting result to supply to 
the motion vector monitoring unit 4\ 

The load detecting circuit 7 detects a system load of the block matching unit 3 such as the number of times of the 
block matching process or an amount of data to be supplied to the block matching unit 3. Then, the load detecting 
circuit 7 generates a search window restriction data based on the detecting result to supply to the motion vector mon- 
itoring unit 4V 

The power supply voltage detecting circuit 8 detects the power supply voltage of the motion vector estimating 
apparatus and generates a search window restriction data based on the detecting result to supply to the motion vector 
monitoring unit 4'. 

In this manner, the motion vector monitoring unit 4' determines the search window based on not only the motion 
vector estimation history but also at least one of the temperature, the system load and the power supply voltage. For 
example, when the detected temperature is higher than a reference temperature, when the detected system load is 
heavier than a reference load, or when the power supply voltage is lower than a reference voltage, the search window 
may be determined to have a smaller size in addition to change of the shape and position. 

In this manner, the shape, size and position of the search window can be adaptively determined based on the 
estimation history of the previous motion vectors and the detected temperature, load or voltage. Then, the block match- 
ing process is carried out between the current block and each of the reference blocks within the search window to 
detect a motion vector. 

Also, in the present invention, the size, shape and position of the search window may be adaptively determined 
based on system load such as the load of the block matching unit 3, the chip temperature of the block matching unit 
3, or the power supply voltage. In this case, when the system load is heavy, this implies that the process required for 
the entire system as the motion vector process is large. Therefore, the size, shape or position of the search window 
is changed. * 

Since the size, shape, or position of the search window is adaptively determined based on the above-described 
system load, no excessive system load is given to the processing capability of the motion vector estimating apparatus. 
In other words, for instance, when the chip temperature is increased, the search window is made narrower. In this 
case, since the search window becomes narrower, the calculation amount by the block matching unit 3 is restricted 
and heat dissipation of the LSI chip is restricted. Similarly, when the power supply voltage from a power source such 
as a cell used to drive the motion vector estimating apparatus is decreased, the search window may be conceivably 
narrowed in accordance with the detected power supply voltage. Thus, in a portable terminal equipment by using a 
cell, the operation time thereof can be extended. 

In the present invention, a motion vector of a frame previous to the current frame by 1 , a motion vector of a field 
previous to the current field by 1, a motion vector of the same frame as that of the current reference region, and a 
motion vector of the same filed as that of the current reference region may be employed as the estimation history of 
the previously estimated motion vectors. 

Also, while utilizing features of pixel data of a reference region, for example, image data related to sports and 
image data related to movies may be referred to for determining the estimation history of motion vectors. 

In the conventional motion vector estimating method, while the same position as the current block within either 
the frame or the field is used as the center, the reference region is determined. 

On the other hand, in the embodiments of the present invention, the position of the search window may be adap- 
tively varied based on the above-described estimation history of motion vectors. That is to say, referring to the estimation 
history of the motion vectors, when the average value of this motion vectors is directed to a predetermined direction, 
the center position of the search window can be moved to this direction. In this case, although the search window is 
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made narrower, it is possible to detect the correct motion vector. 

Also, simitar to the estimation history of motion vectors, either the shape or size of the search window may be 
conceivably changed with reference to the load of the motion vector estimating apparatus, the power supply voltage 
and the chip temperature. In other words, for instance, while monitoring a temperature of an LSI chip on which the 
5 motion vector estimating apparatus is mounted, when this temperature is increased, the reference region is made 
narrower. In this case, since the search window becomes narrower the calculation amount by the LSI may suppress 
heat dissipation of the LSI. 

Similarly, while monitoring a remaining power amount of a power source such as a cell used to drive the motion 
vector estimating apparatus, the search window may be conceivably narrowed in accordance with this remaining power 

w amount. In this case, for instance, in a portable terminal by using the cell, the operation time thereof can be extended. 

Furthermore, in the embodiment of the present invention, the search window determining circuit 1202 may be 
realized by a microprocessor or the like. As a consequence, the search window determining circuit 1 202 may be mount- 
ed on another LSI This is because the changing operation of the shape, size and/or position of the search window 
may be once carried out within 1 frame, 1 field, or a certain time period. Therefore, the process time required for 

is determining the search window does not impede the overall operation. In this case, since the algorithm for determining 
the search window can be programmed, the search window can be more flexibly determined. For example, in the case 
that certain image data is changed from the sports image into the movie image, this image change is determined by 
a human operator, so that the search window may be changed. 

As described above, according to the present invention, the below-mentioned effects can be achieved. 

20 First, in the present invention, the motion vector can be detected in high precision under low power consumption. 

This is because the dimension, shape and/or position of the search window can be adaptively determined based on 
previously estimated motion vectors. Also, the dimension, shape and/or position of the search window can be adaptively 
determined based on the load of the motion vector estimating apparatus such as temperature of the motion vector 
estimating apparatus, the power supply voltage and the number of time of block matching process. 

25 .Second, in the present invention, when the block matching process is performed in the parallel manner, this parallel 

degree is not decreased. As a result, according to the present invention, the block matching process can be performed 
within a short time, and the power consumption can be reduced. Also, according to the present invention, even when 
the search window is made small, the motion vector can be detected in high precision. This is because the search 
window can be flexibly determined and divided into a plurality of reference regions. Also, it is because the data supply 

30 schedule is devised. 

Third, in the present invention, the chip area for the block matching circuit can be saved and the power consumption 
can be reduced. This is because the number of buses required to enter the reference block pixel data is mall. Also, it 
is because the entry of the reference block pixel data is scheduled in such a manner that the number of stage in the 
FIFO units can be reduced. 

35 Fourth, in the present invention, even when the block matching process is carried out over a plurality of reference 

regions, it is possible to execute the block matching process such that the pipeline of the block matching circuit is 
always filled. That is, the parallel degree of the process is not decreased. As a consequence, in accordance with the 
present invention, even when the reference region is subdivided into the plural reference regions, there is no risk that 
the process is increased and the power consumption is increased. 

40 This is because the reference block pixel data of the search window is supplied via the two stage of selectors to 

the bus wiring line connected to the processing elements. 

Fifth, in the present invention, when the minimum value or likelihood is calculated from the summation of the 
difference absolute values, increasing in the total number of the wiring lines from all the processing elements to the 
minimum value detecting circuit can be suppressed. As a consequence, according to the present invention, the motion 

45 vector estimating apparatus can be realized with the smaller chip area and with reduced power consumption. This is 
because the local wiring lines are employed in the wiring lines used to connect the processing elements. 

Sixth, in the present invention, the motion vectors can be simultaneously acquired in response to a plurality of 
prediction modes of MPEG-2 (ISO/iEC 13818-2). In this case, according to the present invention, there are only the 
local wiring lines among the wiring lines connected from the processing elements, and the chip area can be reduced. 

50 This is because the processing elements are divided into four sorts of processing elements as the respective processing 
elements belonging to the even row, the odd row, the upper half row, and the lower half row. 



Claims 

55 

1. A motion vector estimating apparatus comprising: 

a current picture storage unit (1) which stores image data of a current picture; 
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a reference picture storage unit (2) which stores image data of a reference picture; 

a search window determining unit (4) which determines estimation history from previously estimated motion 
vectors, and determines a search window based on said estimation history, wherein said search window is 
composed of rectangular reference regions; 

a block matching circuit (300-1 , 300-2, 300-3) for performing a block matching process to a current block and 
each of reference blocks of said search window to determine a motion vector; and 

a control circuit (10-1, 10-2) which reads out the image data of said current block from said current picture 
storage unit to supply said block matching circuit, which reads out the image data of each of said reference 
blocks of said search window from sard reference picture storage unit to supply to said block matching circuit, 
and which controls said block matching circuit such that said block matching process is performed to said 
current block and said each reference block of said search window. 

2. A motion vector estimating apparatus according to claim 1 , wherein at least one of a shape of said search window 
a size of said search window and a position of said search window is determined based on said estimation history. 

3. A motion vector estimating apparatus according to claim 2, wherein said search window determining unit deter- 
mines whether a number of previously estimated motion vectors exceeds a predetermined number, and neglects 
the previously estimated motion vectors to determine said size and shape of said search window, when it is de- 
termined that the number of previously estimated motion vectors is less than the predetermined number. 

4. A motion vector estimating apparatus according to claim 2 or 3, wherein said search window determining unit 
determines whether a number of previously estimated motion vectors exceeds a predetermined number, produces 
a 2-dimensional map in which each of said previously estimated motion vectors is mapped based on its horizontal 
value and the vertical value, when it is determined that the number of previously estimated motion vectors exceeds 
the predetermined number, and determines said size and shape of the search window based on said 2-dimensional 
map. 

5. A motion vector estimating apparatus according to any one of claim 2 through 4, wherein said search window 
determining unit calculates average vector of said previously estimated motion vectors and determines said posi- 
tion of said search window using said average vector. 

6. A motion vector estimating apparatus according to any one of claim 1 through 5, further comprising a load detecting 
circuit (7) for detecting either one of a load of said motion vector estimating apparatus or a load of said block 
matching circuit, and 

wherein said search window determining unit determines said search window based on said estimation his- 
tory and the detected load. 

. A motion vector estimating apparatus according to any one of claims 1 through 6, further comprising a voltage 
detecting circuit (8) for detecting a power supply voltage, and 

wherein said search window determining unit determines said search window based on said estimation his- 
tory and the detected power supply voltage. 

. A motion vector estimating apparatus according to any one of claims 1 through 7, further comprising a temperature 
detecting circuit (6) for detecting a temperature of a chip on which said block matching circuit is installed, and 

wherein said search window determining unit determines said search window based on said estimation his- 
tory and the detected temperature. 

. A motion vector estimating apparatus according to any one of claims 1 through 8, wherein said search window 
determining unit determines at least an unuseful region in said search window in unit of pixels based on said 
estimation history, and set a flag to each of pixels of said unuseful region such that said block matching process 
to said reference block which includes any pixel of said unuseful region is made invalid based on the flag of the 
pixel of said unuseful region. 

0. A motion vector estimating apparatus according to claim 9, wherein said block matching circuit includes processing 
elements (21) of M columns and N rows (M and N are positive integers) provided such that a likelihood between 
said current block and each of said reference blocks is calculated through pipeline processing. 

1. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit further includes 
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a hierarchical selector structure (2211 to 2214, 2221 to 2224, 23) such that each of said processing elements is 
supplied with one of said pixel data of each of said reference blocks. 

12. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit further includes 
(M * N) reference data buses 206-1 to 206-4, 207-1 to 207-4), which are provided for said M columns and said N 
rows, and N current data buses provided for said N rows, respectively. 

13. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit comprises: 

said processing elements (21 ) of M columns and N rows (M and N are positive integers) , wherein said process- 
ing elements are cascade-connected in each of said N rows, wherein each of said processing elements cal- 
culates an absolute value of a difference between a corresponding pixel data of a current block and one pixel 
data of each of reference blocks, and adds the absolute value to an output from a previous stage of processing 
element to output the adding result to a next stage of processing element, and wherein said N rows are cas- 
cade-connected to allow pipeline processing; 

M reference data column buses (207-1 to 207-4) provided for said M columns, respectively; 
N reference data row buses (206-1 to 206-4) provided for said N rows, respectively; 

(M x N) selectors (22), which are respectively provided for said processing elements to allow one of the pixel 
data on the corresponding one of said M reference data column buses and the pixel data on the corresponding 
one of said N reference data row buses to said corresponding processing element as the reference block pixel 
data; and 

a current block pixel data bus (208-1 to 208-4) for transferring current block pixel data to each of said processing 
elements. 

14. A motion vector estimating apparatus according to claim 1 3, further comprising: 

a reference block data bus group (301 to 304) for transferring pixel data of said reference blocks in units of 
rows as sequences of pixel data; 

a first group of selectors (2211 to 2214) which are respectively provided for said M reference data column 
buses, and each of which selects one pixel data of one of said sequences to supply to the corresponding 
reference data column bus; and 

a second group of selectors (2221 to 2224) which are respectively provided for said N reference data row 
buses, and each of which selects one pixel data of one of said sequences to supply to the corresponding 
reference data row bus. 

15. A motion vector estimating apparatus according to claim 1 3 or 14, wherein each of said processing elements 
latches the corresponding current block pixel data once the corresponding current block pixel data is supplied via 
said current block pixel data bus. 

1 6. A motion vector estimating apparatus, according to any one of claims 1 3 through 1 5, wherein each of said process- 
ing elements is synchronized with a clock signal to perform the calculation of the absolute value of the difference 
and the addition and to output the addition result to the next stage of processing element 

17. A motion vector estimating apparatus according to claim 16, wherein said processing elements calculates a like- 
lihood of one of said reference blocks in a time period from a first time to (M x N) time, and calculates a likelihood 
of a different one of said reference blocks in a time period from a second time to (M x N + 1 ) time. 

18. A motion vector estimating apparatus according to any one of claims 13 through 17, wherein said current block is 
composed of pixels of M columns and N rows. 

19. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
elements are cascade-connected in each of said N rows and said N rows are cascade-connected. 

20. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
elements are cascade-connected in each of said N rows (N is an even number), wherein odd-numbered ones of 
said N rows in an upper half are cascade-connected via first FIFO units, each of which is provided for two adjacent 
odd-numbered rows, wherein even-numbered ones of said N rows in said upper half are cascade-connected via 
second FIFO units, each of which is provided for two adjacent even-numbered rows, wherein odd-numbered ones 
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of said N rows in a iower half are cascade-connected via third FIFO units, each of which is provided for two adjacent 
odd-numbered rows, and wherein even-numbered ones of said N rows in said lower half are cascade-connected 
via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows, and 

wherein said block matching circuit further includes fifth and sixth FIFO units connected to iast stages of 
processing elements in said cascade connections for the odd-numbered rows and the even-numbered rows in 
said upper half, respectively. 

21. A motion vector estimating apparatus according to claim 20, wherein a number of stages in each of said first to 
fourth FIFO units is M, and a number of stages in said fifth FIFO unit is (M x N/2 + M), and a number of stages in 
said sixth FIFO unit is (M x N/2). 

22. A motion vector estimating apparatus according to claim 20, further comprising; 

a fist adder (31 ) adding an output of said fifth FIFO unit and an output of said sixth FIFO unit; 

a second adder (32) adding the output of said fifth FIFO unit and an output of the processing element in a last 

stage of odd-numbered cascade connection in said lower half; 

a third adder (33) adding the output of said sixth FIFO unit and an output of the processing element in a last 
stage of even-numbered cascade connection in said lower half; 

a fourth adder (34) adding the output of the processing element in the last stage of odd-numbered cascade 
connection in said lower half and the output of the processing element in the last stage of even-numbered 
cascade connection in said lower half; and 

a fifth adder (35) adding an output of said second adder and an output of said third adder. 

23. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
elements are cascade-connected in each of said N rows (N is an even number), wherein ones of said N rows in 
an upper half are cascade-connected via first FIFO units, each of which is provided for two adjacent rows, and 
wherein ones of said N rows in a lower half are cascade-connected via second FIFO units, each of which is provided 
for two adjacent rows, and 

wherein said block matching circuit further includes a third FIFO units connected to the processing element 
in a last stage of cascade connection in said upper half. 

24. A motion vector estimating apparatus according to claim 23, wherein a number of stages in each of said first and 
second FIFO units is M, and a^number of stages in said third FIFO unit is (M x N/2). 

25. A motion vector estimating apparatus according to claim 23, further comprising: 

a adder (35) adding an output of said third FIFO unit and an output of the processing element in a last stage 
of cascade connection in said lower half. jr 

26. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
elements are cascade-connected in each of said N rows (N is an even number), wherein odd-numbered ones of 
said N rows are cascade-connected via first FIFO units, each of which is provided for two adjacent odd-numbered 
rows, wherein even-numbered ones of said N rows are cascade-connected via second FIFO units, each of which 
is provided for two adjacent even-numbered rows, and 

27. A motion vector estimating apparatus according to claim 26, wherein a number of stages in each of said first and 
second FIFO units is M. 

28. A motion vector estimating apparatus according to claim 26, further comprising: 

a adder (35) adding an output of the processing element in a last stage of odd-numbered cascade connection 
and an output of the processing element in a last stage of even-numbered cascade connection. 

29. A method of estimating a motion vector, comprising the steps of: 

determining estimation history from previously estimated motion vectors; 

determining a search window based on said estimation history, wherein said search window is composed of 
rectangular reference regions; and 

performing a block matching process to a current block and each of reference blocks of said search window 
to determine a motion vector. 
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30. A method according to claim 29, wherein at least one of a shape of said search window, a size of said search 
window and a position of said search window is determined based on said estimation history. 

31 . A method according to claim 30 t wherein said step of determining estimation history includes determining whether 
5 a number of previously estimated motion vectors exceeds a predetermined number, and 

wherein said step of determining a search window includes neglecting the previously estimated motion vec- 
tors to determine said size and shape of said search window based on said current block, when it is determined 
that the number of previously estimated motion vectors is less than the predetermined number. 

10 32. A method according to claim 30 or 31 , wherein said step of determining estimation history includes: 

determining whether a number of previously estimated motion vectors exceeds a predetermined number; and 
producing a 2-dimensional map in which each of said previously estimated motion vectors is mapped based 
on its horizontal value and the vertical value, when it is determined that the number of previously estimated 
motion vectors exceeds the predetermined number, and 

wherein said step of determining a search, window includes determining said size and shape of the search 
window based on said 2-dimensional map. 

A method according to any one of claim 30 through 32, wherein said step of determining estimation history includes 
calculating an average vector of said previously estimated motion vectors, and 

wherein said step of determining a search window includes determining said position of said search window 
using said average vector. 

A method according to any one of claims 29 through 33, further comprising the steps of: 

detecting either one of a load of said motion vector estimating apparatus or a load of said block matching circuit; 
detecting a power supply voltage; and 

detecting a temperature of a chip on which said block matching circuit is installed, and 
wherein said step of determining a search window includes determining said search window based on said 
estimation history and at least one of the detected load, the detected power supply voltage and the detected 
temperature. 

35. A method according to any one of claims 29 through 34, wherein said steps of:^ 

35 determining a search window includes determining at least an unuseful region in said search window in unit 

of pixels based on said estimation history; and 

setting a flag to each of pixels of said unuseful region such that said block matching process to said reference 
block which includes any pixel of said unuseful region is made invalid based on the flag of the pixel of said 
unuseful region. 

40 

36. A method according to claim 35, wherein said step of performing a block matching process includes performing 
said block matching process in a pipeline processing using processing elements of M columns and N rows (M and 
N are positive integers). 

45 37. A method according to claim 36, wherein said step of performing said block matching process in a pipeline process- 
ing includes: 

in each of said processing elements, calculating an absolute value of a difference between a corresponding 
pixel data of said current block and one pixel data of each of said reference blocks; 
50 adding the absolute value to an output from a previous stage of processing element; and 

outputting the adding result to a next stage of processing element. 

38. A method according to claim 34, wherein said calculating step includes, in each of said processing elements, 
latching the corresponding current block pixel data once the corresponding current block pixel data is supplied via 

55 said current block pixel data bus. 

39. A method according to claim 37, wherein said step of performing said block matching process in a pipeline process- 
ing includes, in each of said processing elements, performing the calculation of the absolute value of the difference 




33. 

20 



34. 



25 



30 



20 



BN8DOCID: <EP_JD83164aAaJL> 



EP 0 831 642 A2 



and the addition and to output the addition result to the next stage of processing element in synchronous with a 
clock signal. 

40. A method according toclaim 39, wherein said step of performing said block matching process in a pipeline process- 
5 ing includes: 

calculating a likelihood of one of said reference blocks in a time period from a first time to (M x N) time; and 
calculating a likelihood of a different one of said reference blocks in a time period from a second time to (M x 
N + 1 ) time. 
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Fig. 6 A 
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